Exploiting Punctuation Semantics in Continuous Data Streams
نویسندگان
چکیده
As most current query processing architectures are already pipelined, it seems logical to apply them to data streams. However, two classes of query operators are impractical for processing long or infinite data streams. Unbounded stateful operators maintain state with no upper bound in size and, so, run out of memory. Blocking operators read an entire input before emitting a single output and, so, might never produce a result. We believe that a priori knowledge of a data stream can permit the use of such operators in some cases. We discuss a kind of stream semantics called punctuated streams. Punctuations in a stream mark the end of substreams allowing us to view an infinite stream as a mixture of finite streams. We introduce three kinds of invariants to specify the proper behavior of operators in the presence of punctuation. Pass invariants define when results can be passed on. Keep invariants define what must be kept in local state to continue successful operation. Propagation invariants define when punctuation can be passed on. We report on our initial implementation and show a strategy for proving implementations of these invariants are faithful to their relational counterparts.
منابع مشابه
Analysing Temporal Relations –
This article proposes an approach to rely on the standard operators of relational algebra (including grouping and aggregation) for processing complex event without requiring window specifications. In this way the approach can process complex event queries of the kind encountered in applications such as emergency management in metro networks. This article presents Temporal Stream Algebra (TSA) w...
متن کاملAnalysing Temporal Relations – Beyond Windows, Frames and Predicates
This article proposes an approach to rely on the standard operators of relational algebra (including grouping and aggregation) for processing complex event without requiring window specifications. In this way the approach can process complex event queries of the kind encountered in applications such as emergency management in metro networks. This article presents Temporal Stream Algebra (TSA) w...
متن کاملExploiting Punctuation Semantics in Data Streams
Applications that process data streams are becoming common: financial applications process streams of stock ticker data; telephone network monitoring applications process streams of call data. These applications often are queries over streams, so it seems natural to use a database management system instead of a custom application. However, some traditional relational operators are not conducive...
متن کاملSemantics of Data Streams and Operators
What does a data stream mean? Much of the extensive work on query operators and query processing for data streams has proceeded without the benefit of an answer to this question. While such imprecision may be tolerable when dealing with simple cases, such as flat data, guaranteed physical order and element-wise operations, it can lead to ambiguities when dealing with nested data, disordered str...
متن کاملAn Abstract Semantics and Concrete Language for Continuous Queries over Streams and Relations
Despite the recent surge of research in query processing over data streams, little attention has been devoted to defining precise semantics for continuous queries over streams. We first present an abstract semantics based on several building blocks: formal definitions for streams and relations, mappings among them, and any relational query language. From these basics we define a precise interpr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Knowl. Data Eng.
دوره 15 شماره
صفحات -
تاریخ انتشار 2003